Metalinguistic Information Extraction for Terminology
نویسنده
چکیده
This paper describes and evaluates the Metalinguistic Operation Processor (MOP) system for automatic compilation of metalinguistic information from technical and scientific documents. This system is designed to extract non-standard terminological resources that we have called Metalinguistic Information Databases (or MIDs), in order to help update changing glossaries, knowledge bases and ontologies, as well as to reflect the metastable dynamics of special-domain knowledge.
منابع مشابه
Mining Metalinguistic Activity in Corpora to Create Lexical Resources Using Information Extraction Techniques: the MOP System
This paper describes and evaluates MOP, an IE system for automatic extraction of metalinguistic information from technical and scientific documents. We claim that such a system can create special databases to bootstrap compilation and facilitate update of the huge and dynamically changing glossaries, knowledge bases and ontologies that are vital to modern-day research.
متن کاملMetalinguistic Information Extraction from Specialized Texts to Enrich Computational Lexicons
متن کامل
Explotación computacional del metalenguaje en corpus especializados para la generación de lexicones no convencionales
This paper presents the application of automatic analysis (of statistical and symbolic nature) for the detection and processing of metalanguage in highly technical texts from various domains. The selective metalinguistic information extraction performed by the MOP system allows compilation of non-conventional lexicons to aid domain-restricted NLP.
متن کاملCorpus-based terminology extraction applied to information access
This paper presents an application of corpus-based terminology extraction in interactive information retrieval. In this approach, the terminology obtained in an automatic extraction procedure is used, without any manual revision, to provide retrieval indexes and a “browsing by phrases” facility for document accessing in an interactive retrieval search interface. We argue that the combination of...
متن کاملFrom Terminology Extraction to Terminology Validation: An Approach Adapted to Log Files
Log files generated by computational systems contain relevant and essential information. In some application areas like the design of integrated circuits, log files generated by design tools contain information which can be used in management information systems to evaluate the final products. However, the complexity of such textual data raises some challenges concerning the extraction of infor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/cs/0504074 شماره
صفحات -
تاریخ انتشار 2004